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RvC/fi Is 3 CONVERT Ftfopranj realized ifl tilt CTS5 H&P of 
Project MM!* lor compiling SEC expressions into the machine language 
tif the PUI’-B computer, Since the c cun J1 ati0« Consists in it* majority 
of subroutine calls (to be compiled^ after removal of LISP parentheass 
by MACPIVfTi the technique is applicable with trivial modification to 
any other computer hiving the subroutine Jimp and indirect transfer 
instructions. The purpose of the program is both tc compile RFC 
expressions and to illustrate the verting* of the 1SEC language, act! 
accordingly n duacTlption of this language is Riven, It contains 
operators and predicates^ flow of control Is achieved by parentheses 
which define subexpressions, colon which imp lies iteration t and 
semicolon which tarmln-Hte* the execution of an expression* Predicates 
pass control to the position foil wine the next colon or semi colon , 
allowing the eiecMion of alternative expression strings* 
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REiC (RCCllUR EGRESSION COMPILER) is a programming language of 
Simple itnictun dove loped originally for the EW-8 computer, but 
readily adaptable- to any other general purpose computer* It has been 
used extensively in teaching Algebra and Numerical Analysis in the 
ESFM, even for pfegr-amming hand calculations with the friden electronic 
desk calculator* In rather vag.Ln; terms, it derives its appeal from the 
fact that computers can he regarded in ana Way or another 05 Turing 
Machines with, very elaborate built-in shortcuts to eliminate the 
grotesque inefficiency of manipulating individual hits on a single 
linear tape, A turing Machine consists of a finite state machine acting 
a% the control of a tape memory; finite state machines in turn are 
conveniently described by regular expressions* The REC notation is a 
manner of writing regular expressions mor amenable to programming the 
Turing Machine which they control* If one does not wish to think so 
strictly in terns of Turing Machines, REC expressions Still provide a means 
of defining the flew of control in a program, which is quite convenient 
in many applications* 

Let E be an alphabet, which presumably would not contain among 
its letters the operational signs which we shall introduce, We then 
define a REC expression recursively in the following manner, 

1) 1 is a REC expression 

ii] {) is a REC expression 

Hi) if.tiul: ; »>, „ is 9 rac 

iv) if n and fl are REC expressions, so is 

v) if a is a REC expression, so is (a) 

The operational signs are used as follows, Parentheses are used 
to denote a single expression* Concatenation is implied by writing 
expressions in sequence* Colon [;J implies iteration of all the 
expression which precedes the colon* Semicolon [;] terminates the 
concatenation of a string. The large period [*] indicates a choice 
between continuing to concatinate the following expressions or to pass 
over theE until the naxt following Obion Or Semicolon (if any] of the 
same parenthesis level is reached* Such a choice is always implied 
following a parenthesised expression* 


It is to he noted that parenthesas Siave a very technical use in 
REC expressions, and ate more than simple signs of prouping, Thus, since 
eoncatinatiOn is asjuei ntivo „ it is always written in its extended form 
without parentheses. When some grouping is desired to he sit own , some other 
symbol, such as square brackets, should be used. The notl-associativity of 
FLEC parenthesiEfltlOfi is often exploited 10 achieve some economy Or simplification 
of expression. 

To see the ooTTaspondcr.cc between regular expressions and REC 
expressions, we first show how any regular expression is to he written 
as a RLiC expression. 

■t + (} 

■> A + \ 

ff + a 

fl/3 ■+■ a 0 

n □ ft 3 ;) 

For the converse process of writ in g the regular expression 
corresponding to a REC expression, it is more convenient to show hew 
to use a REC expression to construct a transition system, whoso regular 
expression (or class of equal regular expressions) may then be deduced. 

The algorithm is as follows, recursively defined, 

1 ) For every PEC expression there will ba an initial 

state and two final states, labelled T and F. 

2) A REC expression is to be road from loft to right, but 

any quantity appearing within parentheses its to 
00 treated as a single expression, recursively, 

. with additional rules governing hew to join its 

initial and final states to those outside the 
parentheses, 

S} if o e Z is aeon, draw an arrow labelled o from the last 
state to a new state, 

A} If i is seen, do the same with a spontaneous transition. 

5) if : is seen, draw an arrow representing a spontaneous 

transition back to the initial state* 

6) If $ is seen, draw an arrow representing a spontaneous 

transition to the final state T, 

7) hbenever either 3 dr 3 is soon, a new state Should bo femed, 

-If * is seen, draw an arrow representing a spontaneous 

transit!an to the state immediately following the next 
: or ;, if any; otherwise to the final state T» 


If a parenthesized expression is seen* apply the 
„ entire algorithm to the parenthesized express! on. 

When this is done, draw an arrow rep resent !ng a 
+ spontaneous transition from the last state to the 

initial state of the enclosed expression. The 
last state written is too final state F of tho 
enclosed expression,, and is to be connected by & 
spontaneous transition to a subsequent state in 
exactly the same manner as had an „ been seen. 

The final state T is to be used as the current 
State in continuing- to process the outer level. 

Id) The last State to be written is the final state F of 

the outer expression; or should be connected to it 
by a spontaneous trims t£ cm if the latter has 
already been prepared* The final state T is the 
accepting state of the transition system. 

As an example of the process, let us consider the transition 
System which we would produce from the HE!C expression 

* C* P- i q* (it q.:0 sw ; ) 

by following the above rules* 



It will be seen that the three F states are all isolated because 
according to this particular REC expression there is on way to arrive at 
any of theft. 

It will be noted that the ESC expressions which are derived from 
regular expressions by the prescription we have offered for* a limited 
eiass among the possible REC expressions* In part this is due to a 
dflsira to leave the series of the EEC expressions relatively we;*, evon 
though it admits a great nunber of expiations which would produce useless 
transition diagrams; for instance we do not exclude the sequence 



fiut there is also the era si deration that although regular expressions 

,ire defined with binary ccr.ru?cti ves „ such as union and cone atinatiqn , 

those connectives are associativa and are gengraily written in 

P* 1 * a tithes 1 1 free form, Although BBC paronthe&os are not associative, 

there nevertheless exist cravenient n-ary forms, equivalent to their 

correspond inn binary forms * For examplE, in a triple union one could. 

write 

tA u is) u c as U{*A;&i);e:) 

A li (B u C) AS £«A*£*&;£;)j) 

but preferable to both is writing 

A u E |] c AS ( a A; n S;C) 

with a similar notation corresponding to a. more oxtGnsive union. 

In fact, such merit as there might be to the REG notation arises from the 
fact that although it might be somewhat Cumbersome to make direct 
transcriptions of regular expressions, there will be a Consequential class 
of expressions which wo will wish to write; whoso REC form will be simpler 
and more convenient than the corresponding regular expressions. Thus the 
correspondences which we have established serves to demonstrate that the 
totality of REC expressions is no morn nor leas general than the totality 
of regular Expressions, 

Since tha intention of JiEC expressions is to control the ope rati on 
of a general purpose computer £hr mere specifically a Turing Muhina), w & 
will expect the letters of the REC alphabet to represent individual 
operations of which the machine is capable. For this reason the letters 
will be called operators , tfOTds of the tet alphabet win then correspond 
to sequences of operations, carried out in the order given. The transition 
system derived from a REC expression will then accept a word of this 
alphabet if it corresponds to a possible saTjes of operations which could be 
carried ait during the calculation in question. Ip the case of a Turing 
Machine, the operators will be to write a symbol, compare a symbol, nova the 
tape left or move the tape right. But the operators will have to be chosen 
according to the circumstances. 

In reality we are not so much interested in recognising a possible 
calculation as in prescribing the particular one Which W0 want among all 
those possible. It Is for this purpose that the large period [<,] was 
introduced, which is related to the operation of union in a regular 
expression* At each place in a REC express ion where „ occurs, there 
is a spontaneous transition in the transition diagram* indicating the 
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possibl lity of a selection among two alternatives; to continue the 
popular sequence* or to start a new one by following the spontaneous 
transition* To specif/ fl particular word among ail those represented 
by a given EEC expression* it is only necessary to specify this choice 
at each place where it becomes possible* h'e night oven assume that 
there are special operators whoso purpose is to make this choice* They 
arc called Pred 1CateS * and will always coiri)ino too symbol implicitly, 
Tims a predicate is a combination of hh appropriate Ope rut or followed by 
the symbol flJ . We will mqreqvTe say that a predicate takes the value 
true or false according to whether the decision is made to continue 
in the rajrular sequence or to follow the spontaneous transition pa:.' 
the nearest colon or semicolon- Every parenthesized expression is 
automatically assumed to be a predicate* although analysis nay show that 
it is Only capable of assuming one of the two possible values* Such 
was the case in cur example. 

The transition diagrams of REC expressions have two final states 
to accomodate their usage as predicates* Thus a calculation definitely 
fails * definitely succeeds* or else is in progress. Moreover the EEC 
notation has been particularly chosen to Facilitate tire formation of 
Boolean combinations of its subexpressions* Thus the combination AND of 
the predicates a* b* c* **.„* n is written 

(abc *.. n;) t 

a notation which is valid for any number of arguments* Thns (;) always 
is a true predicate* whilst a = fail < 

The Combination OR qf these sane predicates would be Written 
(a: b; c; iu) ( 

which again holds for any [writer of arguments* Q is a predicate which 
Is always false* and as before, (a;) = a. 

The coaplenient qf the predicate jt is written 
(*)■ 

We accordingly always have x (fx)]* 

A typical REC expression will begin with a series of operators* 
followed by a predicate which will decide typical questions such as 
whether the calculation is finished and be followed by ;* or whether to 
repeat the whole procedure and be followed by :* When thes* conditions 
fail* there will follow further calculation, expressed by a series of 
operators, and yet another predicate, CXie executes as much of a string as 




lie can until he meets a die limiter, an<l es many strings as tteoossaiy to 
Swot a tominal ton Jit ion. One practical caution which has to bo 
observed is that if several predicates occur in a string# and one has 
reached the end of the string, the AND of ell these predicates is true* 
if one arrives beyond a colon or semicolon# indicating the string has 
failed# he only knows tho AND has- failed# but not which individual 
predicate* This requires either a new test of some of the predicates# or 
a nore cautious rewriting of the me expression* It is one situation in 
which one sometimes wishes there were a more direct control of the flow of 
control in a R£C expression; perhaps by moans of labels and "CO Ws*'" 

Td give SMnn very simple examples of the application of REC# let us 
bear in mind the PUP-8 computer# which has a teletype coded for hi asl 4I 
characters in direct communication with the central processor„ Let R 
bo the operator which reads one such character# either frpm papor tape 
or punched by hand «, the keyboard# and W be the operator which Sends rme 
such character to the teletype* The characters are kept in a workspace 
(the accumulator# say)# and we may imagine 6.1 operators of the type "x 
which plate the character 1 In this workspace erasing the previous 
contents# as veil as t4 predicates which test the workspace for 
equality to the character X* 

The PEC expression 

tR “E; V , 1 * W;) 

Will doublcspace everything which it reads# until the exclamation point 
is encountered and it terminates Operation, 

Let us say that we wish to ignore all text which occurs between 
two stars, An appropriate expression will he 

(R »]; «- (R **;:) t Ht) 

and again it will terminate when an exclamation point is encountered in tha 
printing text. 

By including operators for the binary conversion cf decimal input 
and output# the arithmetic operations# and a test for negative numbers# ona 
COUld formulate EEC expressions for arithmetic calculations. The domain of 
applicability of REC depends Upon its complement of operators and predicates- 
however at present it is only the control structure which interests us. 
Although we describing A Compiler of REC for the PDf-S# the 
description is applicable to the majority of machines because the Compilation 
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ifi Rade entirely in terras of subroutine tills J except: for the part which 
corresponds to HEC's own flow of centre 1, which is realised for the mast 
part by appropriate transfers* 

In the IW-a, a subroutine call is made by means of tSja Instruction 
.IMS [Jump to Subroutine) t Lot us suppose we have the coding configuration 

X, JHS V 

* 4 4 

■ i * 

Y, 00 

■ -i ■ 

jHf* r v 

When the instruction JMS Y, located at address X, is executed, the address 
X*i is stored at Y, and transver is mado to Y+I* ifhen the subroutine is 
terminated, this is dona by the instruction JHP T Y, an indirect transfer 

to Y Which is a transfer to »1, so that the original program is resumed 
in sequence, 

Data Of use to the subroutine Y may be located at addresses X+l, 

*+2 t and so on, and may be accessed indirectly through the address stored 
at location Y. By applying the instruction Isz V, (Increment and ship on 
lero), this data may bg gathered item by item. Moreover, the subroutine 
Y can serve as a predicate, since an ISZ preceding the return jump can 
cause a sfcip to X+2 rather than a return to x+i* 

Jn this way, the predicate, «x, may be treated as a composite 
predicate, footed from a E*w*ral subroutine EQ r which uses the character 
X as a parameter in the calling sequence* *x would then compile into 

JMS EQ 

=3 

(return false} 

(return trite) 

Clearly, this pattern accounts for predicates with miltipla parameters, 
including none/ the falsa return will contain a transfer, corresponding to 
tho spontaneous transit!on of the transition diagram which the REC expression 
defines, while for the true return there will occur further subroutine jumps 
correspondimp to subsequent operators. 

With these preliminaries we may now turn to tho CCSIVEkT program 
EtEC, an annotated listing of which we give below* 


DEFINE (( 


(REC (LAMBDA (L) (PRIHTLIST (COfiVEUT 
(QUOTE ( 

OF PAV (-OR* 1? V?) 

ftn 1 the |in^wi! of th r"p fhiv^ crfaAGiMi of lattero tsiw distinguishcdi 

(OP}, i’/WiiidtCii (TtfJ ttnJ ^JVcJr'ji]£[fj (£F) t In vaja 

category ita members are lintel and t rented as PAV *8 by the CONVERT p mg ran r, 

PR PAV (-OFL- Q) 

CP PAV (■□]?* EQ QU) 

)) 

(QUOTE ( 

X (XXX) 

)) 

L 

(QUOTE (*0 ( 

(PR ((JUS PI!) (JMP FA))) 

Predicates arc compiled a& a subroutine call follwsd by a transfer to FA* 

FA is the heading corresponding to the FALSE exit of tike eapruartt; under 
compilation, S^ii-G transfer* is skipped over U hen the predicate is true, 

(OP ((JMS OP})) 

Gpeni tore are competed by a atJnpZa tfukron£iF3e ca££* 

((CP X) f(JMS CP) (X) (,MR FA))) 

^an^pOliFStf Predicates are compiled as Fre di-oate-S, but their parameter is 
included as part of the calling sequence, 

({-*) (CJMP or) FA)} 

Tne COE VEST program) is written in such a Way that it does not distinguish 
CDR of a Hat from a Hst* Rauaver, these Aai^e to be processed differently, 
and are therefore distinguished by a double asterisk placed in front of 
a fragment whioh has arisen as CDR of a list. When only the double 
asterisk is left,, the of the list has been reached, £Ae spontaneous 
trmsition IJMP OF) aarrespamfiMp to the foot that each parenthesized EEC 
expression is regarded as a predicate is insarteci, and the heading FA is 
placed, since we haps new arrived at tAe first state outside the parenthesis 
to which all false exits in the last segment must precede, 

((+* CO XXX) [ (JMP HE) FA (^SKEL*- FA E1CPR «CNSf» («BEPT* (-* XXX))))) 
When, in examining a RFC expression element by element, we arrive at q colon 
fCTSS DOFS NOT LET US WRITE ALL CHARACTERS, AN TDI0S7NCRACY OF THE LISP INPUT 
ROUTINE), We Write a Spontaneous transition to the initial State fJMP HE}, 
note the false exit point of all predicates tM tAa previous segJ-^snt, and define 
a Meu? false exit potrifc for ensuing segment* The analysis continues with the 
remainder of the REC expression, 44 serving as a signal fcAflf we do Mai deal with 
a new expression*, 

£(*+ SC XXX) ( (-TMF TR) FA (*SKFL 4 FA £XPR ■CN5Y- (»HEPT* (** XXX))}}) 
When a semicolon is encountered, a jrpaMiaaeaiiG fransttian £a ^rcaV to the TRUE 
final state fJtiP TR) t tAe false exit point of alt predicates £h the prsuiOMS 
ff#pnvpjt is noted, and a false exit point is established for the ensuing 
The analysis iAett precedes with the rejoinder of tAa expression* 

((** X XXX) (£*I!EI*r* X) (*REPT* (** XXX)))) 

If neither delimiter is encountered, we otmpile the CAR tAatt the CDR of the 
expression , Car’s and EDR's are not treated uniformly because a new tfllit'&Z 
atatef has to be established for cash g Ube ^ re<ra £ on, but not for each CDR r 

((■«) (-SKEL- HE EXPfl -GNSV- TR -EXPR-- -CN$Y- OF EXPF FA 

(HE (*SKEL* FA EXPE *GNSY= (*ft£PT- (** # 5AME*))) TEl))) 


* JH oempiUng a parenthesised expression, provision iwu^t be made for the 
initial Hate, TRUE final state, and FALSE final state, all of vhich are 

n^-Jiko-c as OERSIm's* These labels frhtft be included nt appropriate points 
in the aonviled code* f t 

))) 

)))) 


( LAMB ™ O) (prog [V) (setq y x) (clock ()} a (print (Car y \\ 

(SETq Y (CDR Y)) (CONI) ((NULL Y) (RETURN (CLOCK T))}) {GO A)))) 

mjjTLI&F.ie an auxiliary function xhiah allots listing the compiled program 
tilth on* FDP-&'instruction per lend, rather than aa a eof/pdai list in the 
ufua^ rwjjgj" that LISP Mould print tf result* 

r } 


As an example of the operation. of JlEC k'e mjjr consider the fpllowlnj; 
example, (REC L> is a function whose argument is, the ROC expression which 
is t P he compiled. On Account of inherent limitations in the orthography 
of the CTSS Lisp input routine* certain substitutions had to he made; 

SC for ;* CO fer (EQ X) for -K ( [qu k) for "X. 


rec ££r (eq 

G03163 
[JMS R) 

(JMS EQ) 

(-) 

(JMP GO 3165) 
(JMP CQ31G3) 
5t>Ji6S 
(JMS EQ) 

(:) 

(JMP GQ3166) 
(JMP G03l6<l) 
C031&6 
(,INS W) 

(JMS R) 

(JMS Q) 

(JMP G03167) 
£JWS H) 

(JMS R) 

(JMS Q) 

(JMP G03A67) 
(JMS W) 

(JMS R) 

{JMS Q) 

(JMP £303167) 
(JMS W) 

(JMP FA) 

GQ3167 
GO i 164 
5 


-) co (eq :) 3t « r q * r i| v : q w)) 
R 


parameter 

fates 

* 

fates esrit of last ssgmant 

parameter 

false 


false exit of lost segment 
R 

Q is m\ ctrbi tr.ury predicate 
fates exit 
W 
R 
Q 

f 

w 


R 

Q 

f 

w 


\ 



exit from last Segment 
aenttHuatLjpn eni higher 
TRUE final State, exit 
(time of execution) 


to FALSE final State 
least.- r exit of F r s in last 
of all semicolons 
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The prej^rarr which is .generated is incKiplete in the sense that it. 
itself stimilJ he finished aff as a subroutine, with, a Hant entry paint 
bearing an appropriate label + .and terminated with appropriate ISZ’s and 
,D1P I f s. 

For case of reference we Conclude with an ua annotated lilting of the 

praBtm. 


DEFINE (( 


(REC (LAM3DA (L) 

(PRINTL1ST (CONVERT 

Cqucrre ( 



OP 

PAY 

(“Oft* ft ft) 

PR 

PAY 

(*or* q) 

CP 

PAY 

(“OR* EQ qu) 

» 



(QUOTE ( 



X (XXX) 



n 

L 



(QUOTE (*0 ( 



m 


((JMS PR) (.IMP FA))) 

(OP 


({JMS OP))) 

((CP X} 


C(JK3 CP) (X) (JWP FA))) 

<(**) 


((JHP OP) FA)) 

((** CO XXX) 

CCJMP JIE) FA ('GKEL* FA i 

{{** SC XXX) 

(CJMPTR) FA (*SKKL* FA i 

{{** * 

XXX) 

£(*REPT* X) ('KEPT* (** 

({—) 


(-SKEL- HE EXPR nGNSY* 


)J) 

)») 


(HE ('SltEL* 


EXPK *GNSY« (*SEPT* (** XXX)])}) 
EXPH *GNS¥- (»REPT» {** SCXxJ ) >55 
XXX)3>3 

TR fcXPR -GMSV- OF EXPS FA 
FA EKPR *GNSY* (-REFT* (** *5AHE*))) TR])) 


(FflINTLIST (LAMBDA (X) (FROG (¥} (SETq Y X) (CLOCK (>) A (PRINT (CAR Y)} 
(SETCl Y (COR *)) (COND ((NULL Y) (RETURN (CLOCK T))])■ (GO A)))) 

)) 
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